9 research outputs found

    A Lightweight Regression Method to Infer Psycholinguistic Properties for Brazilian Portuguese

    Full text link
    Psycholinguistic properties of words have been used in various approaches to Natural Language Processing tasks, such as text simplification and readability assessment. Most of these properties are subjective, involving costly and time-consuming surveys to be gathered. Recent approaches use the limited datasets of psycholinguistic properties to extend them automatically to large lexicons. However, some of the resources used by such approaches are not available to most languages. This study presents a method to infer psycholinguistic properties for Brazilian Portuguese (BP) using regressors built with a light set of features usually available for less resourced languages: word length, frequency lists, lexical databases composed of school dictionaries and word embedding models. The correlations between the properties inferred are close to those obtained by related works. The resulting resource contains 26,874 words in BP annotated with concreteness, age of acquisition, imageability and subjective frequency.Comment: Paper accepted for TSD201

    Genomic–transcriptomic evolution in lung cancer and metastasis

    Get PDF
    Intratumour heterogeneity (ITH) fuels lung cancer evolution, which leads to immune evasion and resistance to therapy. Here, using paired whole-exome and RNA sequencing data, we investigate intratumour transcriptomic diversity in 354 non-small cell lung cancer tumours from 347 out of the first 421 patients prospectively recruited into the TRACERx study. Analyses of 947 tumour regions, representing both primary and metastatic disease, alongside 96 tumour-adjacent normal tissue samples implicate the transcriptome as a major source of phenotypic variation. Gene expression levels and ITH relate to patterns of positive and negative selection during tumour evolution. We observe frequent copy number-independent allele-specific expression that is linked to epigenomic dysfunction. Allele-specific expression can also result in genomic–transcriptomic parallel evolution, which converges on cancer gene disruption. We extract signatures of RNA single-base substitutions and link their aetiology to the activity of the RNA-editing enzymes ADAR and APOBEC3A, thereby revealing otherwise undetected ongoing APOBEC activity in tumours. Characterizing the transcriptomes of primary–metastatic tumour pairs, we combine multiple machine-learning approaches that leverage genomic and transcriptomic variables to link metastasis-seeding potential to the evolutionary context of mutations and increased proliferation within primary tumour regions. These results highlight the interplay between the genome and transcriptome in influencing ITH, lung cancer evolution and metastasis

    Evolutionary characterization of lung adenocarcinoma morphology in TRACERx

    No full text
    Lung adenocarcinomas (LUADs) display a broad histological spectrum from low-grade lepidic tumors through to mid-grade acinar and papillary and high-grade solid, cribriform and micropapillary tumors. How morphology reflects tumor evolution and disease progression is poorly understood. Whole-exome sequencing data generated from 805 primary tumor regions and 121 paired metastatic samples across 248 LUADs from the TRACERx 421 cohort, together with RNA-sequencing data from 463 primary tumor regions, were integrated with detailed whole-tumor and regional histopathological analysis. Tumors with predominantly high-grade patterns showed increased chromosomal complexity, with higher burden of loss of heterozygosity and subclonal somatic copy number alterations. Individual regions in predominantly high-grade pattern tumors exhibited higher proliferation and lower clonal diversity, potentially reflecting large recent subclonal expansions. Co-occurrence of truncal loss of chromosomes 3p and 3q was enriched in predominantly low-/mid-grade tumors, while purely undifferentiated solid-pattern tumors had a higher frequency of truncal arm or focal 3q gains and SMARCA4 gene alterations compared with mixed-pattern tumors with a solid component, suggesting distinct evolutionary trajectories. Clonal evolution analysis revealed that tumors tend to evolve toward higher-grade patterns. The presence of micropapillary pattern and ‘tumor spread through air spaces’ were associated with intrathoracic recurrence, in contrast to the presence of solid/cribriform patterns, necrosis and preoperative circulating tumor DNA detection, which were associated with extra-thoracic recurrence. These data provide insights into the relationship between LUAD morphology, the underlying evolutionary genomic landscape, and clinical and anatomical relapse risk

    Body composition and lung cancer-associated cachexia in TRACERx

    No full text
    Cancer-associated cachexia (CAC) is a major contributor to morbidity and mortality in individuals with non-small cell lung cancer. Key features of CAC include alterations in body composition and body weight. Here, we explore the association between body composition and body weight with survival and delineate potential biological processes and mediators that contribute to the development of CAC. Computed tomography-based body composition analysis of 651 individuals in the TRACERx (TRAcking non-small cell lung Cancer Evolution through therapy (Rx)) study suggested that individuals in the bottom 20th percentile of the distribution of skeletal muscle or adipose tissue area at the time of lung cancer diagnosis, had significantly shorter lung cancer-specific survival and overall survival. This finding was validated in 420 individuals in the independent Boston Lung Cancer Study. Individuals classified as having developed CAC according to one or more features at relapse encompassing loss of adipose or muscle tissue, or body mass index-adjusted weight loss were found to have distinct tumor genomic and transcriptomic profiles compared with individuals who did not develop such features. Primary non-small cell lung cancers from individuals who developed CAC were characterized by enrichment of inflammatory signaling and epithelial–mesenchymal transitional pathways, and differentially expressed genes upregulated in these tumors included cancer-testis antigen MAGEA6 and matrix metalloproteinases, such as ADAMTS3. In an exploratory proteomic analysis of circulating putative mediators of cachexia performed in a subset of 110 individuals from TRACERx, a significant association between circulating GDF15 and loss of body weight, skeletal muscle and adipose tissue was identified at relapse, supporting the potential therapeutic relevance of targeting GDF15 in the management of CAC

    Lung adenocarcinoma promotion by air pollutants

    No full text
    A complete understanding of how exposure to environmental substances promotes cancer formation is lacking. More than 70 years ago, tumorigenesis was proposed to occur in a two-step process: an initiating step that induces mutations in healthy cells, followed by a promoter step that triggers cancer development1. Here we propose that environmental particulate matter measuring ≤2.5 μm (PM2.5), known to be associated with lung cancer risk, promotes lung cancer by acting on cells that harbour pre-existing oncogenic mutations in healthy lung tissue. Focusing on EGFR-driven lung cancer, which is more common in never-smokers or light smokers, we found a significant association between PM2.5 levels and the incidence of lung cancer for 32,957 EGFR-driven lung cancer cases in four within-country cohorts. Functional mouse models revealed that air pollutants cause an influx of macrophages into the lung and release of interleukin-1β. This process results in a progenitor-like cell state within EGFR mutant lung alveolar type II epithelial cells that fuels tumorigenesis. Ultradeep mutational profiling of histologically normal lung tissue from 295 individuals across 3 clinical cohorts revealed oncogenic EGFR and KRAS driver mutations in 18% and 53% of healthy tissue samples, respectively. These findings collectively support a tumour-promoting role for PM2.5 air pollutants and provide impetus for public health policy initiatives to address air pollution to reduce disease burden

    Guidelines for the use and interpretation of assays for monitoring autophagy (4th edition)

    No full text
    In 2008, we published the first set of guidelines for standardizing research in autophagy. Since then, this topic has received increasing attention, and many scientists have entered the field. Our knowledge base and relevant new technologies have also been expanding. Thus, it is important to formulate on a regular basis updated guidelines for monitoring autophagy in different organisms. Despite numerous reviews, there continues to be confusion regarding acceptable methods to evaluate autophagy, especially in multicellular eukaryotes. Here, we present a set of guidelines for investigators to select and interpret methods to examine autophagy and related processes, and for reviewers to provide realistic and reasonable critiques of reports that are focused on these processes. These guidelines are not meant to be a dogmatic set of rules, because the appropriateness of any assay largely depends on the question being asked and the system being used. Moreover, no individual assay is perfect for every situation, calling for the use of multiple techniques to properly monitor autophagy in each experimental setting. Finally, several core components of the autophagy machinery have been implicated in distinct autophagic processes (canonical and noncanonical autophagy), implying that genetic approaches to block autophagy should rely on targeting two or more autophagy-related genes that ideally participate in distinct steps of the pathway. Along similar lines, because multiple proteins involved in autophagy also regulate other cellular pathways including apoptosis, not all of them can be used as a specific marker for bona fide autophagic responses. Here, we critically discuss current methods of assessing autophagy and the information they can, or cannot, provide. Our ultimate goal is to encourage intellectual and technical innovation in the field

    Annuaire 2010-2011

    No full text
    corecore